Modern data mining tools in descriptive sensory analysis: a case study with a Random Forest approach

نویسندگان

  • P. M. Granitto
  • F. Gasperi
  • F. Biasioli
  • E. Trainotti
  • C. Furlanello
چکیده

In this paper we introduce Random Forest (RF) as a new modeling technique in the field of sensory analysis. As a case study we apply RF to the predictive discrimination of 6 typical cheeses of the Trentino province (North Italy) from data obtained by Quantitative Descriptive Analysis. The corresponding sensory profiling was carried out by 8 trained assessors using a developed language containing 35 attributes. We compare RF's discrimination capabilities with Linear Discriminant Analysis (LDA) and discriminant Partial Least Square (dPLS). The RF models result more accurate, with smaller prediction errors than LDA and dPLS. RF also offers the possibility of graphically analyze the developed models with Multi Dimensional Scaling plots based on an internal measure of similarity between samples. We compare these plots with Principal Component Analysis and LDA ones, finding that the same qualitative information can be extracted from all methods. The RF model also gives an estimation of the relative importance of each sensory attribute for the discriminant function. We couple this measure with an appropriate experimental setup in order to obtain an unbiased and stable method for variable selection. We favorably compare this method with sequential selection based on LDA models.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design Method of Railway Transport Stations in Tehran with the Approach of Sensory Architecture (Case Study: Tajrish, Valiasr Square and Mehrabad Airport metro stations)

This article tries to examine the relationship between the components of location creation in stationary rail transport spaces with sensory richness as the most important component of sensory architecture. The research method was descriptive-analytical, in the context of the survey and using library studies and obtaining the opinion of experts by questionnaire and interview, which according to ...

متن کامل

Sensory analysis in the food industry as a tool for marketing decisions

In the food industry, sensory analysis can be useful to direct marketing decisions concerning not only products, for example product positioning with respect to competitors, but also market segmentation, customer relationship management, advertising strategies and price policies. In this paper we show how interesting information useful for marketing management can be obtained by combining the r...

متن کامل

Using Combined Descriptive and Predictive Methods of Data Mining for Coronary Artery Disease Prediction: a Case Study Approach

Heart disease is one of the major causes of morbidity in the world. Currently, large proportions of healthcare data are not processed properly, thus, failing to be effectively used for decision making purposes. The risk of heart disease may be predicted via investigation of heart disease risk factors coupled with data mining knowledge. This paper presents a model developed using combined descri...

متن کامل

Town trip forecasting based on data mining techniques

In this paper, a data mining approach is proposed for duration prediction of the town trips (travel time) in New York City. In this regard, at first, two novel approaches, including a mathematical and a statistical approach, are proposed for grouping categorical variables with a huge number of levels. The proposed approaches work based on the cost matrix generated by repetitive post-hoc tests f...

متن کامل

A Case Study of Random Forest in Predictive Data Mining

The paper examines the potential of a novel data mining method, the random forest classifier, to support managerial decision making in complex forecasting applications. A modelling paradigm is proposed that embraces a learning curve analysis and grid-search to analyse the model’s sensitivity towards the number of training examples and parameter settings, respectively, and, eventually, produce a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006